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CREATING AND CONTROLLING OVERLAP IN TWO-LAYER 
NETWORKS. APPLICATION TO A MEAN-FIELD SIS 
EPIDEMIC MODEL WITH AWARENESS DISSEMINATION. 

DAVID JUHER AND JOAN SALDANA 


Abstract. We study the properties of the potential overlap between two net¬ 
works A, B sharing the same set of N nodes (a two-layer network) whose 
respective degree distributions Pa(^)^Pb(^) are given. Defining the overlap 
coefficient a as the Jaccard index, we prove that a is very close to 0 when A 
and B have been independently generated via the configuration model algo¬ 
rithm. We also derive an upper bound olm for the maximum overlap coefficient 
permitted in terms of PA{k), Pb{ &) an( l 2/. Then we present an algorithm 
based on cross-rewiring of links to obtain a two-layer network with any pre¬ 
scribed a inside the range (0, cum)- Finally, to illustrate the importance of 
the overlap for the dynamics of interacting contagious processes, we derive a 
mean-field model for the spread of an SIS epidemic with awareness against 
infection over a two-layer network, containing a as a parameter. A simple 
analytical relationship between a and the basic reproduction number follows. 
Stochastic simulations are presented to assess the accuracy of the upper bound 
ajvf an( l the predictions of the mean-field epidemic model. 


1. Introduction 

Some contagious processes interact with each other during their propagation, 
which can occur either through the same route of transmission or through routes 
that share the same set of nodes but use different types of connections. In the 
second case, the description of the spread uses the concept of multilayer or multi¬ 
plex network, namely, a set of nodes (individuals, computers, etc.) connected by 
qualitatively different types of links corresponding to possible relationships among 
them (acquaintanceship, friendship, physical contact, social networks, etc), each 
layer defined by a type of connection. Competitive viruses spreading simultane¬ 
ously through different routes of transmission over the same host population, or 
the spread of a pathogen and awareness during an epidemic episode are examples 
of processes that are better described by means of multilayer networks Elg¬ 
in the last years it has been a development of the mathematical formulation of 
multiplex networks and, also, of more general interconnected networks for which 
the set of nodes does not need to be the same at each layer HU El EE]. Moreover, 
recent results show the importance of the interrelation between different layers 
in determining the fate of competitive epidemic processes HIED]. In other cases, 
however, the importance of such an interrelation is not so evident from the analytical 
results of the epidemic threshold Ham], or even seems to be not relevant at all 

m- 

Only a few papers dealing with competing epidemics over multilayer networks 
focus on the impact of layer overlap on the epidemic dynamics HI HI HU- In 
m, the authors consider a sequential propagation of two epidemics using distinct 
routes of transmission over a network consisting of two partly overlapped layers. 
Using bond percolation, it is determined the success of a second epidemic through 
that part of its route of transmission whose nodes have not been infected by the 
first epidemic. In m, the authors develop an analytical approach to deal with 
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simultaneous spread of two interacting viral agents on two-layered networks. In 
this work, moreover, the respective effects of overlap and correlation of the degrees 
of nodes in each layer on the epidemic dynamics are considered. 

Here the overlap a between two (labeled) networks A and B of N nodes is 
defined as the fraction of links of the union network that are common links of A 
and B or, equivalently, the probability that a randomly chosen link of the network 
A U B is simultaneously a link of both A and B. In fact a is the Jaccard index 
as defined in [2]. Just to illustrate that this simple statistical parameter can play 
a critical role in the qualitative response of a two-layer network model, in Sect. [2] 
we present a mean-field model for the spread of two contagious process interacting 
each other, namely, the spread of an infectious agent and the raising in awareness 
of preventive behaviours. As an interesting feature, the overlap coefficient between 
the networks embedding the respective routes of transmission is a parameter of the 
model. This allows us to derive a simple relationship between a and the epidemic 
threshold. Provided that one wants to perform simulations to validate this (or any) 
model, a systematic procedure to generate couples of networks of given size and 
degree distributions with a prescribed value of a would be a useful tool. 

However, the following natural question arises: Given respective degree distribu¬ 
tions pA(k) andpsik) for each network layer, which is the range of attainable over¬ 
lap coefficients between them? In previous papers dealing with this issue |141121j . 
a joint degree distribution p{k\,k 2 ,kf) is considered to generate a two-layer net¬ 
work with arbitrary overlap by decomposing it into three non-overlapped networks. 
The third marginal degree distribution is the one for the overlapped part of the 
two layers, whereas the other two correspond to the non-overlapped parts of each 
layer. Therefore, the probability that a randomly selected node has degree k\ on 
the first layer and degree &2 on the second one is given by the joint degree distribu¬ 
tion P(ki, £ 2 ) obtained from p as P{k\,k 2 ) = Ylk 3 /°(^i ~ ^ 3,^2 — ks, £ 3 ). In other 
words, the overlap between both layers is prescribed before hand by p{k\, £ 2 , kf). In 
contrast, our approach is based on the study of the potential overlap between two 
networks whose (finite, empirical) degree distributions are previously fixed. More 
precisely, in Sect.[3]and[4]we estimate the minimum and maximum values (call them 
a m and ckm) for the overlap coefficient between two networks of size N and degree 
distributions PA(k) and ps(fc). In Sect. [5] we present an algorithm that takes as 
input ./V, pA(k ), ps(fc) and a £ (a m , oim), and generates a couple of networks of N 
nodes, with respective degree distributions PA{k) and ps(fc) and overlap coefficient 
close to a. So we are given a tool to test the analytical predictions relating overlap 
and epidemic thresholds. Finally, in Sect. [ 6 ] we assess the accuracy of the predic¬ 
tions of the mean-field formulation by comparing them to stochastic simulations of 
the contagious processes over complex random networks. 

2. Motivation of the problem: a mean-field SIS epidemic model 

DEFINED ON A TWO-LAYER NETWORK 

We start this section by fixing some terminology. All along this paper, the 
nodes of any network will be labeled with the natural numbers {1,2,...,IV}. The 
cardinality of a finite set X will be denoted by |Aj. Let V = {1,2,..., N} for some 
N £ N. Let E and E' be two subsets of {{*, j} : i j and i,j £ V}. Let G and 
G' be the undirected networks having V as the set of nodes and E and E' as the 
respective sets of links. The union network GUG' is the undirected network whose 
sets of nodes and links are V and E U E' respectively. By definition, we will say 
that G and G 1 are different from each other if and only if E ^ E'. In particular, if 
we have a network H and we simply permute the labels of the nodes of H , then we 
obtain a network that is in general different from (but isomorphic to) H. Observe 
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that the union operation is not a topological invariant: the union of two networks 
does not depend only on their shapes but also on the way their nodes are labeled. 
The overlap between G and G' is defined as the fraction 

OvlG G') = lEnE ' 1 = lEnE 

V \EUE'\ \E\ + \E’\-\EnE'\' 

which can be thought as the probability that a randomly chosen link of G U G’ is 
simultaneously a link of both G and G'. 

A degree set of cardinality TV is a multiset (i.e. multiple instances of each element 
are allowed) of N integers that is realizable as the set of degrees of a network. 
That is, there exist a labeling {k\, fo,..., fcjv} of the elements of the set and a 
network G of N nodes such that hi is the degree of the node i. The ordered list 
(fei, & 2 j • • ■ i Ajv) will be called the degree sequence of G. A probability distribution 
p{k) with bounded support will be called empirical (of N nodes) if it is realizable 
as the degree distribution of a network of N nodes. That is, there exists a network 
G of IV nodes such that: 

(51) The degree set {ki, & 2 ,..., /cat} of G satisfies the well-known Havel-Hakimi 
condition Hang 

(52) N k := \{i : ki = k}\ = p(k)N 

(53) ki =: 2 L is even 

(54) If (k) denotes the expected degree of a node, then ( k)N = 2 L. 

We use the term empirical for a degree distribution to distinguish it from a (the¬ 
oretical, not necessarily with bounded support) probability distribution p{k). In 
this case, for any N £ N, one can use several standard algorithms (see Sect. [3]) 
to construct a network Gat of IV nodes whose empirical degree distribution pN(k) 
is close to p(k), in the sense that, for big enough values of N, pN{k) converges in 
probability to p(k) (0, Theorem 2.1). 

2.1. The model. Epidemic models describe the spread of infectious diseases on 
populations whose individuals are classified into distinct classes according to their 
infection state as, for instance, the class of susceptible (S) individuals and the class 
of infectious (I) ones. A closer look at the physical transmission of an infection re¬ 
veals that a suitable description of populations must take into account the network 
A of physical contacts among individuals, with nodes representing individuals and 
links corresponding to physical contacts along which disease can propagate. On the 
other hand, if one assumes that the probability of getting infected through an infec¬ 
tious contact S-I depends on the awareness state of the susceptible individual, then 
a second network B over which information about the infection state of individuals 
circulates can be considered. This dissemination network shares the same set of 
nodes with the one of physical contacts, but has a different set of links representing, 
for instance, relationships with friends and acquaintances. So, if a pair of individ¬ 
uals, one susceptible and the other infectious, are connected to each other on both 
networks, one can assume that the transmission rate /3 C (here c stands for common) 
will be smaller than the normal transmission rate j3. This is because susceptible 
individuals have information about the health state of their infected partners and 
react by adopting preventive measures to diminish the risk of contagion. 

According to this scenario, next we derive a mean-field susceptible-infectious- 
susceptible (SIS) epidemic model which implicitly assumes spreading of both infor¬ 
mation and an infectious agent over a two-layer network. Following the standard 
approach for sexually transmitted diseases (STDs) where the heterogeneity in the 
number of contacts (sexual partners) is a basic ingredient [ 1 ] , individuals are classi¬ 
fied according to their infection state and their number of physical contacts. So, the 
model will take into account the network layer A of physical contacts in terms of its 
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degree distribution pA(k) = Nk/N where Nk is the number of individuals having 
degree k. Analogously, the dissemination network (network layer B ) is described 
by its degree distribution ps(fc). A key assumption in the model derivation is the 
existence of a partial and uniform overlap between the links of each layer, which 
means that the probability of finding two nodes connected to each other in both 
networks does not depend on the degrees of the pair. For sake of brevity, a pair of 
such nodes is said to share a common link , although the natures of the connections 
are dissimilar. 

Within each layer, it is assumed that there is no degree-degree correlation, i.e., 
neighbours in each layer are randomly sampled from the population according to the 
so-called proportionate mixing of individuals uni. This means that, in each layer, 
the probability P[k'\k) that a node of degree k is connected to a node of degree 
k! is independent of the degree k and it is given by the fraction of links pointing 
to nodes of degree k', i.e., P{k'\k) = k'p{k')/{k) [TJ3- Therefore, the expected 
degree of a node reached by following a randomly chosen link, i.e., the expected 
degree of a neighbour in a population with proportionate mixing is (k 2 )/(k). On 
the other hand, let Ik be the number of infectious nodes of degree k in network 
layer A. Although the links are unordered pairs of connected nodes by definition, 
let us consider that every link {u, r;} gives rise to two oriented links u —> v and 
v —y u. Then, the probability that a randomly chosen oriented link of A leads to an 
infectious node is given by the fraction of oriented links in A pointing to infectious 
nodes, that is, 



where (Ia) is the average degree in A , and ik := Ik/N is the fraction of nodes that 
are both infectious and of degree k in A. 

Finally, let La, Lb, and Lahb denote the number of links of A, B, and common 
links, respectively. Let Pb\a be the probability that a randomly chosen link of A, 
an A-link, connects two nodes that are also connected in B, that is, Pb\a = ■ 

Similarly, pa\b = L t n B B the P r °bability that a randomly chosen H-link is a 
common link to both networks. With all these quantities, the epidemic spreading is 
described in terms of the following differential equation for the number of infectious 
nodes of degree k in layer A: 



(1) 


with Sk = Nk — Ik being the number of susceptible nodes of degree k in layer A. 
Here /3 is the transmission rate through a non-common infectious link, and /3 C is 
the transmission rate through a common infectious link. 

The first term in the rhs of m is the rate of creation of new infectious nodes 
of degree k in A due to transmissions of the infection through links that only 
belong to layer A, whereas the second term is the rate of creation of new infectious 
nodes from transmissions across common links. The last term accounts for the 
recoveries of infectious nodes, which occur at a recovery rate p. Here (/ca)pb|a is 
the expected number of common oriented links. Therefore, since this number is 
the same regardless the network we use to compute it, the following consistency 
relationship must follow: 


(kA)PB\A = (kB)PA\B■ 


( 2 ) 


Now let us express Pb\a and Pa\b i n terms of the overlap a := Ov(A, B ), which 
is defined as a = k- AnB where Laub is the set of links of the union network A\J B. 

J-'AUB 
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Using (S4), Pb\a can be expressed in terms of a as follows: 


Pb\a — 


Ladb 


Laub Laub La + Lb — Laub 

= a ---= a 


La Laub La La 

From this simple relationship it immediately follows that 


= a 1 + 


( k B) 
( k A) 


- Pb\a 


Pb\a = 1 + 


(fcs)\ a 

(k A ) J 1 + a' 


( 3 ) 


Similarly, we also have that pa\b = f 1 + 77 — r ) ———■ Note that, as expected, 

V (kb) / 1 + a 

Pb\a an< 3 Pa\b fulfil relationship ©• 

Introducing (J3J) into Eq. ©. the overlap appears as a new parameter of the 
model which now, in terms of the fraction if. of nodes that are both infectious and 
of degree k, reads 

dik k ( a (, (k B ) , a , (fcflA 


dt 


1 + a 




1 + 


( k A ) J 


a) {p A (k) - ik)Qi - ph- (4) 


This equation corresponds to the standard SIS model for heterogeneous populations 
with proportionate mixing, but with an averaged transmission rate which depends 
on a. 

Simple facts about this equation are: 

(1) By Lemma f4.ll an upper bound for the maximum overlap coefficient is given 
by min{(/c^), (ks)}/ max{(/c^), (fcs)}. Since the factor a/(l + a) in © is 
increasing in a, when (fc^) < (ks) we get Pb\a < 1 while, for (kA) > (kB), 
we get p B \A < (k B )/(kA) < 1- 

(2) If (3 C = (3 or a = 0, Eq. d4]) reduces to the classic SIS-model, as expected, 
because information dissemination plays no role in the infection spread. 
If a = 1, we actually have one network and again Eq. © reduces to the 
SIS-model but now with /3 replaced by /3 C . 

To determine the impact of the network overlap on the initial epidemic growth, 
we linearise a about the disease-free equilibrium = 0\/k and obtain that the 
elements of the Jacobian matrix J* evaluated at this equilibrium are 

J*kk' = TTTkk'pAik ) - p5 kk ' 

(«a) 

where /3 0 (a) := (/? (l - +0 c (l + a) / (1 + a) and 5 kk ' is the Kro- 

necker delta. Since the dominant eigenvalue of the matrix ( kk'pA(k )) is equal to 
(fc^) = J2k k 2 PA(k) (with an associated eigenvector whose components v k are pro¬ 
portional to kpA(k)), it follows that the dominant eigenvalue of J* is 


Ai 


M> 

(kA) 


A) (a) — /b 


which corresponds to the initial growth rate of the epidemic (cf. [I] [22] for a = 0). 
From this expression we get that Ai decreases with a when /3 C < /3. 

We can also measure the initial epidemic growth in terms of the basic repro¬ 
duction number Rq , i.e., the average number of secondary infections caused by a 
typical infectious individual at the beginning of an epidemic in a wholly susceptible 
population m- Interpreting /3o(a) as an averaged transmission rate weighted by 
the overlap coefficient a and recalling that (k\) /^a) is the expected degree of a 
neighbour in a population with proportionate mixing, Rq is given by 


(k\) A)(oQ 

(k A ) p 


(k\) 

(k A )(l+a)p\ P \ (k A ) J 
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Therefore, as expected from the expression of Ai, Rq is a decreasing function of 
the overlap coefficient between the two layers as long as /3 C < j3. Note that this 
expression of Rq is a straightforward extension of the one obtained in |L for hetero¬ 
geneous populations and STDs. Figure [l] shows this relationship when layer A has, 
for instance, an exponential degree distribution with minimum degree /c m i n = 10. 
For this distribution, (k^) = 2fc m j n and (fc^)/(fc^) = 5/2 ■ fc m j n which amount to 
the values used in the figure. 



Figure 1. Graph of Ro of the SIS model as a function of the 
overlap coefficient a. Parameters: p = 1, 0 = 0.1, /3 C = 0.005, 

(/ca) = 20, (k\) = 500, and (ks) = 26. For these mean degrees, 
a G [0,10/13] by Lemma ITT! 

As usual, it would be desirable to test the accuracy of the model by collating the 
numerical integration of equations Q with the output of stochastic simulations. 
Note that in the derivation of (U) we have assumed the statistical uniformity of 
several network features. In particular, observe that the entire degree distribution 
Ps{k) of the dissemination network plays no role in the equations (this is not the 
case for pA{k)). In fact, the role of layer B is reduced to its mean degree (ks) via 
the term Pb\a- In consequence, it makes sense to perform stochastic simulations 
with a number of different topologies for A and B, in order to evaluate in which 
situations the mean-field nature of the model fails in giving accurate predictions 
for the epidemic progression. On the other hand, we are mainly interested in the 
overlap a as the critical parameter of the model. So, once the empirical degree 
distributions PA{k) and ps(fc) are decided, we aim at performing simulations for 
several values of a. Taking it all into account, the following natural questions 
arise. First, which is the possible range of permitted overlaps between any couple 
of networks A , B with previously fixed size N and empirical degree distributions 
PA(k) and ps(fc)? Second, given a value of a inside this range, it is possible to design 
an algorithm to construct two networks A and B whose degrees are respectively 
distributed according to PA(k) and ps(fc) with the prescribed overlap a! Both 
issues are discussed in the following sections. 

3. The expected overlap between two random independent layers 

Assume that we are given two empirical degree distributions p(k),p'(k) of N 
nodes, with corresponding degree sets K and K'. Let n and n' be the total number 
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of pairwise different networks having respectively K and K' as degree set, each 
one numbered with an integer in the range [l,n] (respectively, [ 1 , 71 ']). Then we 
can clearly consider a function of two variables Ov(x, y ) on the grid of all pairs 
(x,y) of integers in [l,n] x [l,n'], that gives the value of the overlap of the net¬ 
works numbered as x and y. Observe that the function Ov(x, y) has a global 
minimum/maximum. These extremal values w r ill be denoted by MinOv(A, K') and 
MaxO v(K,K'), or by MinOvAr(p,p') and MaxOv/v(p,p'). The problem of find¬ 
ing or estimating MinOvjvO^f/) and MaxOvjv(p,p') naturally arises. Note that a 
brute force algorithm to compute them by exploring Ov(x, y) for all (a;, y) £ R is 
not feasible, since n and n! are of order N\. In this section we give an upper bound 
for MinOvjv(p,p / ) in terms of the size N and the degree distributions p{k),p'{k). 
The analogous problem for MaxO vn(p,p') will be the matter of Sect. [2 

We need to recall the standard configuration model algorithm [313121] to gen¬ 
erate a random network with a given degree distribution and size. We will use the 
following fast and efficient version of the algorithm. Let A be a degree set and let 
(fci, & 2 , • • ■, fcjv) be any degree sequence obtained by labeling the elements of K. In 
particular, 2 L := )T) ki is even. Now take a vector X of length 2 L containing ki 
times the integer 1 in the first k\ entries, /C 2 times the integer 2 in the following &2 
entries, etc. Each entry v of X represents a single stub (or semi-link) attached at the 
node labeled as v. Then, take a random permutation of the entries of X to get a new 
array Y. Finally, read the contents of Y in order, interpreting each pair of consecu¬ 
tive entries v, w as a link between the nodes v and w. For an example, take N = 6 
and consider the degree distribution p(k) defined by p( 1) = p( 3) = 1/6, p( 2) = 4/6 
and p(k) = 0 for k 1, 2, 3. The corresponding degree set is {1, 2, 2, 2, 2,3}. Take 
(1, 2, 2, 2, 2, 3) as degree sequence. Then, X = (1, 2,2, 3,3,4,4,5, 5, 6,6, 6). Now 
we permute X at random, obtaining Y = (3,4, 5,1, 6,3, 6, 2,4, 5, 2,6). The set of 
links of the obtained network is {{3,4}, {5,1}, (6,3}, {6, 2}, {4, 5}, {2,6}}. Observe 
that the link {6, 2} appears twice. In general, the configuration model algorithm 
gives multigraphs rather than graphs. It is well known, however, that the fraction 
of self-loops and multi-links over the total number of links goes to 0 when N —» 00 

m- 

It seems natural to expect that the overlap between two networks of respective 
degree distributions p{k),p'(k) and size N generated via the configuration model 
algorithm is very small. When the respective mean degrees are small with respect 
to the total size N this turns out to be true. To prove this fact, we need to estimate 
the probability that two given nodes are connected in a random network generated 
via the configuration model algorithm. So, let G be a network of N nodes, L 
links and degree distribution p{k). Assume that G has been obtained by means of 
the configuration model algorithm starting with a degree sequence (k±, k^, ■ ■ ■, fc/v). 
Take at random any pair {i,j} of nodes with ki < kj. Next we estimate the 
probability pij that the network G contains the link {z, J} - This probability is 
given by the quotient a/b , where b is the total number of rearrangements Y of 
the vector X (here we are using the notation introduced in the definition of the 
configuration model) and a is the number of such rearrangements having at least 
two consecutive entries i , j (or j. i) in places Y n , Y n+ 1 for n = 1,3, 5,..., 2 L — 1. 
We have that 

‘“infer ‘ 5 > 

Let us compute a. For l = 1,2let Y l be the set of rearrangements Y 
containing the entries i, j (or j, i) in places Y^-i, Y 2 /. Then, a = (F 1 UF 2 U.. .UY L \. 
By the inclusion-exclusion principle, a = ai — + •.. + (—l) fci ^ 1 afc i , where a; is 

the sum of the cardinalities of all intersections of l sets in Y 1 , Y 2 , ..., Y L . A simple 
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combinatorial argument yields that 

( l ,)2 1 (2L-21)\ 

n . = _ Vi/ v _'_ for l < k 

k\\k 2 \ ■ ■ ■ fcj_i!(fci — Z)!/cj + i!- ■ • kj-i\(kj - l)\kj\■ ■ ■ /cjv! - 

while ai = 0 for ki < l < L. Using the previous expression and the inclusion- 
exclusion principle we get that 


ki 


kx\k 2 \ ■ ■ ■ k i ^ 1 \k i+1 l ■ ■ ■ kj-il(k j+ i)\ ■ 




(- 1 ) 


l-l 


L\ i (2L — 2l)\ 
l) {h - l)\{kj - l)\' 


Taking it all into account, we get the following result. 


Theorem 3.1. Let G be a random network of L links and N nodes with degree se¬ 
quence (Jfei, k 2 , • • •, kj\r), generated via the configuration model algorithm. Let {i,j} 
be any pair of nodes with ki < kj. Then, the probability that G contains the link 
{i,j} is 

/■!/,•,!/.;,! , (2L-2Q! 

^ (2 L)\ ^ ] 1\{L — l)\(ki — l)\{kj — 1)1 ’ 


The expression given by Theorem 13.II is too complex to be used to estimate the 
expected overlap between two random networks. Instead, we will use the following 
standard approximation for the probability pij |121 125] : 


Pij 


kikj 
2L — 1 


( 6 ) 


This formula can be obtained from the proof of Theorem 13.11 after replacing a 
simply by a± (here we are using the notation of the proof). The approximation 
d6]) is good enough only when ki and kj are small with respect to L , in particular 
when we consider networks with bounded mean degree and large size N, which is 
the case for most modeling applications. However, in general ([6]) can significantly 
differ from the exact formula given by Theorem 13.II 

Using the approximation ([6]) we show that the expected overlap between two 
random networks generated via the configuration model is very small, regardless of 
the particular distributions p{k),p'{k ), as the next result states. 


Theorem 3.2. Let p{k ), p' (fc) be two degree distributions with respective means (k) 
and (k'). Let G,G' be two networks of N nodes and degree distributions p{k) and 
p'{k ) generated via the configuration model algorithm. Assume that N is big enough 
with respect to ( k) and ( k') in such a way that the approximation ([6]) holds. Then, 
the expected overlap between G and G' can be approximated by 


O v(G, G') 


(k)(k f ) 

N{(k) + (k 1 )) — {k)(k') ’ 


Proof. Let L, L' be the number of links of G and G' respectively. Assume that G 
and G' have been generated via the configuration model algorithm starting with 
respective degree sequences (fci, k 2 ,..., fcjv) and {k[, k ' 2 ,..., k' N ). Using the approx¬ 
imation © we can compute the probability p that two different nodes chosen at 
random are neighbors in G: 


P 


1 

2L- 1 


y" kip{ki)kjp{kj) 

ki,kj 


(k) 2 

2L-1 


2 L N ’ 


(7) 


where in the last expression we have used (S4). Now the expected overlap between 
G and G' can be computed as the probability that two different nodes are connected 












CREATING AND CONTROLLING OVERLAP 


9 


in both G and G' over the probability that they are connected in GUG' which, by 
virtue of 0, is 

{k)(k')/N 2 


1 - 1 - 


W 

N 

□ 


1 - 


N 


□ 


Theorem CS2 tells us that given N and any two degree distributions p(k),p'(k), 
the minimum overlap MinOvjv(p,l0 is very close to 0, at least when N is big with 
respect to the expected values (k) and (k'). Of course, for small networks this is 
not true in general. 


4. An UPPER BOUND FOR THE MAXIMUM OVERLAP 


In this section we give an upper bound for MaxOvjv(p,p / ) in terms of the size 
N and the degree distributions p(k),p'(k). 

Let G, G' be two networks of N nodes and empirical degree distributions p(k),p'(k), 
with means (k) and (k 1 ). Let L and L' be the number of links of G and G'. If E 
and E' are the sets of links of G and G', then by definition 


Ov(G, G') 


\EC\E'\ 

\EUE’\ 


\E^E'\ 

L + L' - \EnE'\ 


x 

m + (k')) f 


=: F{*), 


( 8 ) 


where x stands for \EC\E'\ and in the last equality we have used (S4). Now observe 
that F{x) is increasing as a function of x. Finally, note that x cannot be larger than 
min{ L,L'}. Assume without loss of generality that L < L'. So, an upper bound 
for the maximum overlap permitted between G and G' is obtained when replacing 
x by L = ( k)N/2 in the previous expression, leading to 


°v( G ,G')<§. 

So, we have proved the following result. 


Lemma 4.1. Let p(k ), p'(k) be two empirical degree distributions of N nodes with 
respective means (k) and ( k'). Then, 


MaxOvN(p,p') < 


min{(fc), (k r )} 
max{(fc), (k 1 )} 


The upper bound in Lemma [4.11 is too crude in general. In particular, one can 
have two completely different degree distributions with the same expected values. In 
this situation, at least intuitively, there are important restrictions for the maximum 
value of the overlap, while the upper bound in Lemma 14. II is 1. Let us see how to 
improve it. 

Assume that we are given two degree sequences D = (k\ , ..., fciv) and D' = 

(, k[ , k' 2 , ■ ■ ■, fcjv), with = {k)N =: 2 L and — (k')N =: 2 L'. Since F{x) in 
© is increasing in x, an upper bound for the overlap is obtained when replacing x 
by the maximum possible number of links of the intersection network. In the proof 
of Lemma ITTl this maximum was taken to be min {L,L'}. To get a much better 
estimate, look at a particular position 1 < i < N of the degree sequences. It is clear 
that the intersection network cannot have more than min{fci, fc'} links attached at 
node i. In consequence, the total number of links of the intersection network is at 
most 

1 N 

L(D,D') := -^min{/ci,/c'}. 

i= 1 

The previous constant depends on the degree sequences D and D'. Of course, re¬ 
ordering the elements of D and D' by means of permutations cr, r we get two degree 
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sequences a {D) 1 t(D') representing two networks with the same degree distribution. 
In consequence, we have the following result. 


Theorem 4.2. Let p(k ), p'(k) be two empirical degree distributions of N nodes and 
respective degree sets K, K'. Let Ln(p,p') := max{L(.D, D')}, where the maximum 
is taken over all pairs D,D' of degree sequences obtained rearranging the elements 
of K and K' respectively. Then, 


MaxOvj v(p,p') < 


_ Ln{p,p’) _ 

((k) + (k '))f - L N (p,p') 


It is not easy to give a closed formula for Ln(jj,p') in terms of N, p{k) and 
p'{k). Alternatively, one could compute L(D,D') for all possible pairs D,D' and 
select the maximum. This brute force algorithm is not feasible since the number 
of operations is about N\. Fortunately, there exists an alternative and very fast 
algorithm to compute Lj^{jp, p') that relies on the following simple lemma. 


Lemma 4.3. Let (ki, ^ 2 ,..., fcjv) and (k[, k 2 ,..., k' N ) be two sequences of nonneg¬ 
ative numbers such that k\ < &2 < ... < ftjv . If there exists a pair of indices i < j 
such that k[ > fc', then 

min{/ei, k[} + minjfcj, kj} < minjfci, fc'} + minlfcj, k[}. 

Proof. Since fc,; < kj and k[ > kj, there are 6 cases to be considered to test the 
prescribed inequality: 

• kj < k[ < ki < kj 

• kj < ki < k[ < kj 

• k'j < ki < kj < k[ 

• ki < kj <k[< kj 

• ki < kj < kj < k[ 

• ki < kj < k'j < k[. 

It is trivial to check that the lemma holds in each case. □ □ 


As a consequence of Lemma [mi and Theorem 14.21 we get the following result. 


Theorem 4.4. Let p{k),p'[k) be two empirical degree distributions of N nodes. 
Let D = (fci, & 2 ,..., fcjv) and D' = (k[, k ' 2 ,..., k' N ) be the degree sequences obtained 
by ordering increasingly the respective degree sets. Then, 

N 

^ min {A,, k'} 

MaxO v N {p,p') < ^ -. 

maxjfcj, k[} 


Proof. Lemma fTol states that if S , S' are degree sequences fitting to p(k) and p'(k) 
such that S is increasingly ordered and there is a pair of entries s' > s' of S' with 
i < j, then if we swap both entries the obtained sequence S" satisfies L(S, S') < 
L(S, S"). Therefore, the maximum Ljsr(p,p') := max{L(5, 5")} is attained precisely 
in L(D, D'). Since, by definition, L(D, D') = (1/2) J] min{fcj, fc'}, Thcorem l4.2l and 
(S4) yield 


MaxOvAi(p,p') < 


_(1 / 2 )E i min {fcp^}_ 

( 1 / 2 )(E, ki + k[) - (1/2) Ei min{fcj, fc'} 


Since ki + k[ = maxjfci, fc'} + minj/cj, fc'}, the theorem follows. □ □ 
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Regular 

Poisson 

SF 

Exponential 

Regular 

0.7693 

0.7508 

0.6301 

0.6654 

Poisson 

0.7552 

0.7709 

0.7221 

0.7739 

SF 

0.5451 

0.6000 

0.7688 

0.7023 

Exponential 

0.6330 

0.7077 

0.7715 

0.7706 


Table 1. Upper bounds for the maximum overlap permitted be¬ 
tween pairs of empirical distributions according to Theorem l4.4l In 
all cases, N = 10000. For the left column distributions, (k) = 20 
while, for the upper ones, (k) = 26. 


Theorem 14.41 allows us to design an efficient algorithm to compute an upper 
bound for the maximum overlap. The algorithm takes as input the empirical dis¬ 
tributions p(k ) and p'{k). Sort increasingly the elements of the respective degree 
sets to get sequences D = (Aq, fc 2 ,..., fcjv) and D' = (k[, k' 2 , ..., k' N ). Finally, 
return min{fcj, fc'}/ Y2 maxjfci, fc'}. In Table |T| we show the output of this algo¬ 
rithm for several pairs of empirical degree distributions, obtained via the configu¬ 
ration model from a corresponding pair of (theoretical) distributions. In all cases, 
N = 10000. Here ”SF” stands for a scale-free network with p{k) = Ck~ J with 
7 = 3, minimum degree to, cut-off k c = mlV 1 / 2 , and the normalization constant 
G, for which (k) « 2 m (see Sect. [6] for details). ’’Exponential” corresponds to 
p(k) = (1 /m)e 1 ~ k ^ rn with minimum degree to, for which (k) = 2m. ’’Poisson” 
corresponds to p{k) = A e~ x /k\ with A = (fc), and ’’Regular” stands for a random 
network for which all nodes have the same degree. 

5. An algorithm to get a prescribed overlap 

Assume that we have generated two random networks G(0),G'(0) of N nodes 
using the configuration model. Let p(k), p'(k) be the corresponding empirical degree 
distributions. This section aims at designing an efficient algorithm to construct two 
networks G, G' of N nodes with respective degree distributions p(k) and p'{k) in 
such a way that Ov(G, G') ~ a , for any given MinOvjv(p,p') < a < MaxOvAr(p,p'). 
Taking into account that, in view of Theorem 13.21 Ov(G(0), G'(0)) ~ 0, it seems 
natural to propose an algorithm that works as follows. At each time step t > 0, 
modify the networks G(t), G r (t) a little bit by performing a local operation (an 
operation involving few nodes and/or links) to obtain new networks G(t + 1), G'(t + 
1) with empirical degree distributions p(k),p'(k) in such a way that Ov(G(t + 
l),G'(f + 1)) is slightly larger than Ov(G(t), G'(t)). Repeat until the overlap is 
close to a. 

The kind of local operation that we will use in the scheme above is a cross 
rewiring operation [32], according to the following definition. Let G(f),G'(t) be 
two networks of N nodes. A good pair in G(t) with respect to G'(t) is a pair of links 
{a, b}, {c,d} in G(t) satisfying the following conditions: 

(1) {a, b} and (c, d} are not links in G'(t) 

(2) (a, c} and (6, d} are not links in G(t ) 

(3) (a, c} is a link in G'(t). 

Analogously we define a good pair in G’(t) with respect to G(t ) by interchanging the 
roles of G(t) and G'(t) in the previous definition. Given a good pair {a, 6}, (c, d} 
in G{t) with respect to G'(t ), the associated cross-rewiring operation consists of 
replacing the links {a, b} and {c, d} in G(t ) by {a, c} and {6, d} to get a new network 
G(f + 1). Observe that G(t ) and G(t + 1) are in general different as non-labelled 
networks. However, the degrees of the involved nodes a, 6, c, d are not modified 
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Regular 

Poisson 

SF 

Exponential 

Regular 

0.00051 

0.00051 

0.000469 

0.000509 

0.000458 

0.000478 

0.000478 

0.000487 

Poisson 


0.000599 

0.000789 

0.000383 

0.000836 

0.000503 

0.000807 

SF 



0.000565 

0.002548 

0.000652 

0.001671 

Exponential 




0.000681 

0.001755 


Table 2. The overlap between two random networks before and 
after relabeling the nodes increasingly with the degree. In all cases, 
N = 10000 and (k) = 10. 


after performing the cross-rewiring. In consequence, G(t) and G(t + 1) have the 
same degree distribution. On the other hand, set G'(t + 1) = G'(t) and let E(t), 
E(t + 1), E'(t), E'(t + 1) be respectively the sets of links of G(t), G(t + 1), G'(t), 
G'(t + 1). Then, \E'(t + 1)| = \E'(t)\ and, by the definition of the cross rewiring 
operation over a good pair, | E(t + 1)| = \E(t)\. Moreover, by the definition of a 
good pair, either | E(t + 1) fl E'(t + 1)| = | E(t) H E'(t) | + 1 if {b, d} is a link in 
G'(t) or | E(t + 1) fl E'(t + 1)| = \E(t) 0 E'{t) + 2 otherwise. Then, if we denote 
Ov(G(f), G'(t )) and Ov(G(f + 1 ), G'(t + 1)) by Ov(f) and Ov(f + 1) respectively, a 
trivial computation yields that 


O v(£ + 1) = Ov(t) + 


x Ov(£) 2 + 2x Ov(f) + x 
L — x — x Ov(t) 


(9) 


where x G {1,2} and L = \E(t)\ + \E'(t)\. In other words, the overlap after 
performing a cross rewiring operation in a good pair of links slightly (but strictly) 
increases. 

From now on, let MinOvjv(p,p0 < & < MaxOvAr(p,p') be the desired overlap 
coefficient. In view of what has been said, the following algorithm seems natural. 
Use the configuration model to construct two random networks G(0),G'(0) of size 
N and degree distributions p(k),p'(k). The expected overlap is close to 0. Now, at 
each time step t > 0, choose at random (if it exists) a good pair of links in G(t) 
with respect to G'(t). Perform a cross rewiring operation in G(t) using such a pair, 
obtaining a new network G(t+ 1). SetG'(£+l) :=G'(t). Then, Ov(G(f +1), G'(t + 
1)) > Ov(G(f), G'(f)) by ©. If Ov(G(£ + 1), G'(t + 1)) > a, set G := G(t + 1), 
G' := G'(t + 1) and stop. Otherwise, proceed to the next time step. 

A serious objection can be raised against the above algorithm as stated: there is 
no reason to expect that proceeding in this way we can reach values of the overlap 
close to MaxOvjv(p,p'). It may well be that no more good pairs can be found to 
be rewired, long before reaching the desired overlap a. To overcome this problem, 
we turn back to the proof of Theorem 14.41 the number of common links containing 
a given node i cannot be larger than min{fcj,fc'}, where ki and are the degrees 
of node i in the respective networks. According to the proof of Theorem 14.41 to 
maximize the number of possible common links that will be obtained by performing 
a sequence of cross rewiring operations, it is enough to relabel the nodes increasingly 
with the degree. However, in doing this, we should make sure that the overlap 
between the original random networks does not change significantly (and remains, 
in consequence, close to 0). To support this claim, see Table [2] 

So, let us consider the following CR Algorithm (standing for Cross rewiring ), 
taking p(k), p'(k), N and a as input: 
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CR Algorithm. 

(CR1) Use the configuration model to get two random networks H( 0), H'{ 0) of size 
A and degree distributions p(k),p'(k). Sort increasingly the respective de¬ 
gree sequences (Aq, ..., Aqv) and (k[,... ,k’ N ). This corresponds to relabeling 
the nodes of both H( 0), H'{ 0) to get two networks G(0), G'(0) isomorphic 
to H( 0), H'{ 0) respectively in such a way that ki < k 3 and k[ < fc' whenever 
i < j. The overlap between G(0) and G'(0) is close to 0. 

At each time step t > 0: 

(CR2) Choose at random (if it exists) a good pair of links in G(t) with respect 
to G'(t). Perform a cross rewiring operation in G(t) using such a pair, 
obtaining a new network G(t + 1). Set G'(t + 1) := G'(t). Then, by 
Ov(G(t + 1), G'(t + 1)) > Ov(G(t), G'(t)). If Ov(G(t + 1), G'(t + 1)) > a, 
set G := G(t + 1), G' := G'(t + 1) and stop. Otherwise, proceed to the next 
time step. 

It is clear that after a finite number to of steps the algorithm will stop, either 
because no good pairs are found or because the overlap between G(ffi) an d G'(to) 
is very close to a. In any case, the output of the algorithm is the pair of networks 
G(to), G’(to). It is also clear that the algorithm admits some variants. For instance, 
the cross rewiring operations can be performed also over good pairs in G'(t) with 
respect to G(t). A natural question is whether in general the algorithm may halt 
forced by the condition that no good pairs are found, before having reached a value 
of the overlap close to a. This question can be reworded as follows: does the 
algorithm produce a value of the overlap coefficient close to MaxO vn(p,p') when 
we execute it with a = MaxOvjv(p,p')? (observe that in this case the algorithm 
will stop if and only if no good pairs are found). In Tabic [3] we show the maximum 
overlap generated using the CR Algorithm for several pairs of distributions, together 
with the upper bounds computed via the Theorem 14.41 In all cases, the obtained 
overlap is reasonably close to the theoretical maximum. 

6. Simulations 

We have performed a series of stochastic simulations with pairs of networks of 
size A = 10000. In order to evaluate the accuracy of the analytical predictions 
depending on the network structure, we have chosen several (theoretical) degree 
distributions p(k),p'(k) for each layer. Once the size A and the respective distri¬ 
butions p(k) and p'(k) are chosen, we proceed as follows: 

(1) Generate two random networks Aq and Bq with empirical degree distri¬ 
butions PA{k) ss p{k) and ps(fc) « p’(k) using the standard configuration 
model. 

(2) Use the corresponding degree sets and Theorem 14.41 to estimate the max¬ 
imum overlap coefficient a max (between any two networks distributed ac¬ 
cording to pA(k),p B (k)). 

Now we are ready to test the relevance of the layer overlap as a model parameter 
by choosing several values of a in the range (0,a max )- For any of such values, we 
use the CR Algorithm to construct two networks A a , B ai distributed according to 
PA(k),pB(k ), with an overlap coefficient very close to a. With these ingredients 
we can simulate, using the standard Gillespie algorithm [T5], the stochastic time 
evolution of the infection spread. In each case the initial number of infected nodes 
is set to 1000 (10% of the population size). The infected individuals are drawn from 
the whole population with the same probability 1/A. In fact, for each pair A a , B a 
we run 10 simulations with 10 different initial sets of infected nodes in order to 
average the outputs. 
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Regular 

Poisson 

SF 

Exponential 


1 

0.739020 

0.564752 

0.448772 

Regular 

1 

0.7761 

0.6112 

0.5004 

Poisson 


0.993035 

0.994325 

0.654052 

0.7180 

0.583859 

0.6345 

SF 



0.97987 

0.98575 

0.665794 

0.7095 


Table 3. Maximum overlap generated using the CR Algorithm 
(first row) vs the maximum value permitted by Theorem 14.41 (sec¬ 
ond row). In all cases, N = 10000 and (, k) = 10. 


What we compare with the simulation outputs is the numerical integration of 
the model m, feeding it with the empirical degree distribution pA(k). We use 
the empirical distribution instead of the theoretical one p{k) because, when the 
variance of p(k) is large (highly heterogeneous networks), there can be noticeable 
differences among distinct finite samples of p(fc), in particular with respect to the 
values of the highest degrees which, as we will see, have a noticeable impact on 
the epidemic dynamics. To avoid degree-degree correlations within a layer due 
to the occurrence of very high degrees in the generated degree sequence, we have 
normalized the power-law distribution p{k) = Ck ~ 3 to have a minimum degree 
to and a maximum degree given by the cut-off k c (N) = mN 1 ' 2 , defined as the 
value of the degree above which one expects to find at most one node in the whole 
network. This expression of k c (N ) coincides with the so-called structural cut-off for 
this exponent of the power law (see 0), and leads to the normalization constant 
C = (7 — l)m' y ~ 1 N/(N — 1) and an expected degree ( k) = 2mN/(N — 1) ss 2 to. 

As initial condition ifc(0) to integrate (U) and according to the procedure in 
the stochastic simulations, we consider that the same fraction of susceptible nodes 
becomes infected for any degree k. In particular, we take *fc(0) = 0.1 PA{k) for all 
k , which amounts to a 10 % of initially infected nodes. 

There is however a crucial remark on the simulation experiments. Recall that the 
lack of degree-degree correlations inside each layer was a basic assumption in the 
derivation of ©■ Therefore, to asses the goodness of the model we must make sure 
that all pairs of networks A a ,B a used in our simulations satisfy this assumption. 
It is reasonable to expect that the pairs of networks created via the CR Algorithm 
are uncorrelated, since: 

(1) The initial networks G(0), G'(0) are randomly generated via the configura¬ 
tion model algorithm, which is known to produce uncorrelated networks. 

(2) A cross rewiring operation in a good pair of links {a, b}, {c,d} increases 
(respectively, decreases) the global degree-degree correlation if the new links 
connect the two nodes with the smaller degrees and the two nodes with 
the larger degrees (respectively, if one of the new links connects the node 
with the largest degree to the node with lowest degree). But the rewiring 
criterion in the CR Algorithm is intended to increase the overlap coefficient 
and has nothing to do with the degrees of the four involved nodes. So, some 
reconnections will increase the global degree-degree correlation and some 
will decrease it, thus expecting an overall balance. 

To support this claim, we show in Table@]the standard Pearson coefficient r for each 
layer, computed from the two random variables defined by the degrees of the nodes 
at both ends of randomly chosen links [25]. Values of r close to —1 (respectively 
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a = 0.2 

a = 0.4 

a = 0.6 

Poisson 

SF 

-0.003353 

0.016494 

-0.003353 

0.057019 

0.022467 

0.085278 

SF 

Exponential 

-0.007762 

-0.010041 

-0.007762 

-0.070960 

-0.007762 

-0.070971 

Poisson 

Exponential 

-0.003353 

0.040715 

-0.003353 

0.088316 

-0.003353 

0.139976 


Table 4. Pearson coefficient to measure the degree-degree corre¬ 
lations in each layer for several pairs of networks obtained from 
the CR Algorithm. In all cases, N = 10000 and (k) = 10. 


1 ) account for dissortative (resp. assortative) networks, while values close to 0 
correspond to uncorrelated networks. 

We have performed two series of experiments addressed to illustrate the influence 
of the two factors appearing in ([3]), both of which are related to the topology of 
the layers. The first factor accounts for the difference in link density between both 
layers (measured by the ratio of their mean degrees), whereas the second one is an 
increasing function of the overlap coefficient a. First, we consider that both layers 
have exponential degree distributions but different minimum degrees and, hence, 
different mean degrees, (/ca) and (fcg). Figure [5] shows, for a = 0.1 (left panels) 
and 0.6 (right panels), the prevalence of the epidemic when (/ca) = 30 > (/eg) = 20 
(top panels) and when (/ca) = 20 < (ks) = 30 (bottom panels). From this figure 
it follows that the epidemic will be better contained when (an important part of) 
layer A can be embedded in layer B, which is only possible when (/ca) < (/cb). Such 
an embedding is clearly not possible when the number of links is much larger in 
layer A than in layer B ((/ca) > (/c_b))- 

Next, we compare network layers with the same expected number of links (same 
mean degrees) but different network topologies. The aim is to see how a non- 
uniform overlap makes the epidemic dynamics depart from the model predictions. 
Keeping the same heterogeneous degree distribution in layer A, we vary the degree 
heterogeneity in layer B by considering Poisson, exponential and power-law degree 
distributions. To make the differences more noticeable, we take a parameter com¬ 
bination leading to epidemic extinction according to model ©• As Figure [3] shows, 
when the variance of degrees in layer B is low (top panel) the nodes with the highest 
degrees in layer A have a much lower fraction of common links than those with low 
degrees once the CR algorithm has been applied. This means the violation the hy¬ 
pothesis of a uniform overlap between layers, and allows a higher transmission of the 
infection which leads to a (low) prevalence of the disease, instead of the epidemic 
die-out predicted by the model. As the variance of degrees in layer B increases 
(middle and bottom panels), the disagreement between simulations and the model 
prediction decreases. In fact, in the bottom panel the epidemic extinction is also 
observed in the simulations because layer B has a degree sequence generated from 
the same power-law distribution as the one used to generate the degree sequence 
of layer A and, hence, a higher uniformity in the overlap is achieved. 

7. Discussion 

We have proposed a cross-rewiring algorithm to create and control the overlap 
between two networks with prescribed degree sets. The wider range of permitted 
overlap coefficients, from 0 to values very close to the theoretical upper bound 
given by Theorem 14.41 is obtained by cross-rewiring networks whose nodes have 
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time (infectious periods) 


Figure 2. Fraction of infectious nodes averaged over 10 runs of 
stochastic simulations carried out on two-layered exponential ran¬ 
dom networks of size N = 10000 for a = 0.1 (left panels) and a = 
0.6 (right panels). Top panels: (Au) = 30 and (ks) = 20. Bottom 
panels: (fc^) = 20 and (ks) = 30. Dashed line shows the preva¬ 
lence () >2 k ik) predicted by the SIS model ([2]). Initial fraction of 
infected nodes: 10%. Parameters: fj, = 1, /3 = 0.1, /3 C = 0.005. 


been labelled according to their rank in the ordered degree sequences, suggesting 
that the overlap coefficient and the inter-layer degree-degree correlation can be 
quite independent from each other. This algorithm allows to check the predictions 
of a mean-field SIS model with awareness dissemination in a host population, where 
the routes of propagation for the infectious agent and awareness are embedded into 
a two-layer network. 

A key ingredient of the model is the probability Pb\a that a randomly chosen link 
of layer A connects two nodes that are also connected in layer B , i.e., the probability 
that an A-link is a common link. Its expression, given by ©. shows that, as one 
could expect, it increases with the overlap between the layers but, moreover, it 
is also a linear increasing function of the ratio (Ate)/(Aui), which measures the 
difference in the number of links of each layer, La and Lb- In particular, if (fc^) > 
(fcs), Lemma [4.11 savs that a < (fee)/(A’ a) = Lb/La, and, so, Pb\a < Lb/La- 
This inequality simply reflects the fact that layer A cannot be embedded in layer B 
(since Pb\a < !)• Conversely, if (fc^) < (ks), then a < (kA)/{kB ), and © implies 
that Pb\a A 1- This result agrees with what one would expect from the definition of 
Pb\a because, when all the A-links are common links, Pb\a = 1 even for a < 1, but 
such an embedding is only possible when La A Lb- The effect of these asymmetric 
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0.2 



0 6 12 18 24 30 

time (infectious periods) 

Figure 3. Fraction of infectious nodes averaged over 10 runs of 
stochastic simulations carried out on a two-layered network of size 
N = 10000 for a = 0.6. In all cases layer A has a power-law degree 
distribution ( p(k ) ~ k~ 3 ) with k m = 5 ((/ca) = 10), whereas 
layer B has Poisson (top), exponential (middle) and power-law 
(bottom) degree distribution with the same mean degree ((&b) = 
10). Dashed line shows the prevalence (%2 k ik) predicted by the 
SIS model d3|. Initial fraction of infected nodes: 10%. Parameters: 
fi = 1, P = 0.1, and /3 C = 0.005. 


roles played by each mean degree on the epidemic progression is illustrated in Fig. [2] 
for networks with the same type of degree distribution but different mean degrees. 
Clearly, in this example the epidemic spread is only contained when the mean degree 
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of the layer B , over which awareness dissemination occurs, is higher than that of 
layer A and, moreover, the overlap between layers is high enough (right bottom 
panel). 

A basic assumption in the derivation of the model is the uniform distribution of 
the overlap over the set of nodes. This means that those nodes with high degrees in 
layer A have the same fraction of overlapped links that those with lower degrees. Of 
course, this will not be the case when there is a large asymmetry between the degree 
distributions of each layer. One can observe the differences when layer A, the one 
over which physical contacts occur, has a power-law degree distribution whereas 
dissemination layer B has a Poisson degree distribution. When both degree distri¬ 
butions have the same mean degree, those nodes with the highest degrees in layer 
A only have a small fraction of overlapped links because of the low variance of the 
Poisson distribution. This amounts to an underestimation of the epidemic preva¬ 
lence by the mean-field SIS model d4j) since those nodes acting as a superspreaders 
in layer A have proportionally much less contacts with a low transmission rate (see 
top panel in Fig. [3]). In contrast, by increasing the variance of the degree distribu¬ 
tion of layer B 1 disease transmission is reduced and the epidemic evolution is closer 
to the one predicted by the the mean-field model (see bottom panel in Fig. [3] where 
layer B has the same power-law degree distribution as layer A). 

In general, when the mean-field assumptions are met, stochastic simulations con¬ 
firm that the proposed SIS model is suitable for modelling two interacting conta¬ 
gious processes like epidemic spreading and awareness dissemination. In particular, 
due to the nature of their interaction, the model predicts a decreasing relationship 
between R 0 and the overlap coefficient a (see Fig. [[]). Moreover, although the an¬ 
alytical prediction of the mean-field model is not accurate close to the epidemic 
threshold i?o(c*0 = 1, the behaviour of the prevalence with network overlap shows 
a good agreement with stochastic simulations when the overlap coefficient is not 
so close to its critical value. With this respect, it would be interesting to consider 
which relationships between overlap and epidemic thresholds follow for more gen¬ 
eral epidemic network models as those considered in (JO) [29) [31], which are based 
on the adjacency matrix of each network layer and allow for both degree-degree 
correlations within and between layers, and a non-uniform overlap between layers. 

Finally, note that a similar mean-field approach for modelling epidemic spreading 
on single heterogeneous networks was adopted in using, as state variable, the 

fraction pk of nodes of degree k that are infectious. The connection between this 
approach and the one traditionally used in epidemiology is given by the relation¬ 
ship between the state variables, namely, ik = h/N = Ik/Nk ■ Nk/N = pkp(k) (see 
|23|). These works were more focussed on aspects of network topology and, in par¬ 
ticular, the absence of epidemic threshold was proved in [J] for scale-free networks 
with degree-degree correlations, i.e., for networks with a mixing pattern such that 
P(k'\k) ^ k'p{k')/(k ). Such a network-oriented approach offers an alternative way 
for analysing the impact of overlap on epidemic spreading on two-layer networks 
with non-proportionate mixing within each layer (see [52] for an extension of this 
formalism to interconnected networks). With this respect, the cross-rewiring algo¬ 
rithm used to generate overlapped networks with arbitrary degree distributions can 
be adapted to control the intra-layer degree-degree correlation during the process. 
This network attribute, however, will restrict the value of the maximum attain¬ 
able overlap coefficient because it reduces the number of ’’good pairs” as long as 
correlations within each layer are preserved. Indeed, the dependence between cor¬ 
relations and the maximum attainable overlap constitutes an interesting topic for 
future work. 
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